Improving the competitiveness of discriminant neural networks in speaker verification
نویسندگان
چکیده
The Artificial Neural Network (ANN) Multilayer Perceptron (MLP) has shown good performance levels as discriminant system in text-independent Speaker Verification (SV) tasks, as shown in our work presented at Eurospeech 2001. In this paper, substantial improvements with regard to that reference architecture are described. Firstly, a new heuristic method for selecting the impostors in the ANN training process is presented, eliminating the random nature of the system behaviour introduced by the traditional random selection. The use of the proposed selection method, together with an improvement in the classification stage based on a selective use of the network outputs to calculate the final sample score, and an optimisation of the MLP learning coefficient, allow an improvement of over 35% with regard to our reference system, reaching a final EER of 13% over the NIST-AHUMADA database. These promising results show that MLP as discriminant system can be competitive with respect to GMM-based SV systems.
منابع مشابه
Factor analysis of mixture of auto-associative neural networks for speaker verification
This paper introduces the theory of factor analysis of the mixture of Auto-Associative Neural Networks (AANNs) with application in speaker verification. First, we formulate the problem of learning a low-dimensional subspace in part of the mixture of AANNs parameter space, and subsequently derive the update equations by minimizing loss function of the mixture. Second, we apply this technique to ...
متن کاملTandem deep features for text-dependent speaker verification
Although deep learning has been successfully used in acoustic modeling of speech recognition, it has not been thoroughly investigated and widely accepted for speaker verification. This paper describes an investigation of using various types of deep features in a Tandem fashion for text-dependent speaker verification. Three types of networks are used to extract deep features: restricted Boltzman...
متن کاملOnline Monitoring and Fault Diagnosis of Multivariate-attribute Process Mean Using Neural Networks and Discriminant Analysis Technique
In some statistical process control applications, the process data are not Normally distributed and characterized by the combination of both variable and attributes quality characteristics. Despite different methods which are proposed separately for monitoring multivariate and multi-attribute processes, only few methods are available in the literature for monitoring multivariate-attribute proce...
متن کاملTime-Contrastive Learning Based Unsupervised DNN Feature Extraction for Speaker Verification
In this paper, we present a time-contrastive learning (TCL) based unsupervised bottleneck (BN) feature extraction method for speech signals with an application to speaker verification. The method exploits the temporal structure of a speech signal and more specifically, it trains deep neural networks (DNNs) to discriminate temporal events obtained by uniformly segmenting the signal without using...
متن کاملImproving Speaker Verification for Reverberant Conditions with Deep Neural Network Dereverberation Processing
We present an improved method for training Deep Neural Networks for dereverberation and show that it can improve performance for the speech processing tasks of speaker verification and speech enhancement. We replicate recently proposed methods for dereverberation using Deep Neural Networks and present our improved method, highlighting important aspects that influence performance. We then experi...
متن کامل